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Abstract: Clustering plays an important role in data mining, pattern recognition, and machine learning. 
Then, single-valued neutrosophic sets (SVNSs) are a useful means to describe and handle indeterminate 
and inconsistent information, which fuzzy sets and intuitionistic fuzzy sets cannot describe and deal 
with. To cluster the data represented by single-value neutrosophic information, the article proposes a 
single-valued neutrosophic minimum spanning tree (SVNMST) clustering algorithm. Firstly, we defined 
a generalized distance measure between SVNSs. Then, we present an SVNMST clustering algorithm for 
clustering single-value neutrosophic data based on the generalized distance measure of SVNSs. Finally, 
two illustrative examples are given to demonstrate the application and effectiveness of the developed 
approach. 


Keywords: Neutrosophic set, single-valued neutrosophic set, minimum spanning tree, clustering algorithm, 
generalized distance measure. 


*Corresponding author: Jun Ye, Department of Electrical and Information Engineering, Shaoxing University, 508 Huancheng 
West Road, Shaoxing, Zhejiang Province 312000, P.R. China, Phone: +86-575-88327323, e-mail: yehjun@aliyun.com 


1 Introduction 


Clustering plays an important role in data mining, pattern recognition, and machine learning. Clustering 
data sets into disjoint groups is a problem arising in many domains. Generally, the goal of clustering is to 
find groups that are both homogeneous and well separated, that is, entities within the same group should 
be similar and entities in different groups dissimilar. Then, data sets can be represented by weighted graphs, 
where nodes correspond to the entities to be clustered and edges correspond to a dissimilarity or similarity 
measure between those entities. Graph theory [3] provided us with a very convenient tool to describe cluster- 
ing problems. However, a minimum spanning tree (MST) is a very useful graph structure and can capture per- 
ceptual grouping [4]. Zahn [13] defined several criteria of edge inconsistency for detecting clusters of different 
shapes and proposed the clustering algorithm using MST. Xu et al. [10] introduced three MST algorithms and 
applied them to clustering gene expression data. Owing to the fuzziness and uncertainty of many practical 
problems in the real world, Ruspini [7] first presented the concept of fuzzy division and a fuzzy clustering 
approach. Dong et al. [2] introduced a hierarchical clustering algorithm based on fuzzy graph connected- 
ness. Then, Chen et al. [1] put forward a fuzzy graph maximal tree clustering method of the fuzzy graph 
constructing the fuzzy similarity relation matrix and used the threshold of fuzzy similarity relation matrix 
to cut maximum spanning tree, and then obtained the classification on level. Zhao et al. [15] proposed two 
intuitionistic fuzzy minimum spanning tree (IFMST) clustering algorithms to deal with intuitionistic fuzzy 
information and extended them to clustering interval-valued intuitionistic fuzzy information. Furthermore, 
Zhang and Xu [14] introduced a MST algorithm-based clustering method under hesitant fuzzy environment. 
To represent uncertainty, imprecise, incomplete, and inconsistent information that exist in real world, 
Smarandache [8] gave the concept of a neutrosophic set from philosophical point of view. The neutrosophic 
set [8] is a powerful general formal framework that generalizes the concept of the classic set, fuzzy set, 
interval valued fuzzy set, intuitionistic fuzzy set, interval valued intuitionistic fuzzy set, paraconsistent set, 
dialetheist set, paradoxist set, and tautological set. In the neutrosophic set, truth membership, indetermi- 
nacy membership, and falsity membership are represented independently. However, the neutrosophic set 
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generalizes the above-mentioned sets from the philosophical point of view, and its functions T,(x), I,09, 
and F,(x) are real standard or nonstandard subsets of ]-0, 1'[, ie., T,(x): X > ]-0, I'L, 1,0): X > }0, I'L 
and F " (x): X >], 1'[. So it will be difficult to apply in real scientific and engineering areas. Thus, Wang 
et al. [9] introduced a single-valued neutrosophic set (SVNS), which is an instance of a neutrosophic set. 
It can describe and handle indeterminate information and inconsistent information, which fuzzy sets and 
intuitionistic fuzzy sets cannot describe and deal with. Recently, Ye [11, 12] presented the correlation coef- 
ficient of SVNSs and the cross-entropy measure of SVNSs and applied them to single-valued neutrosophic 
decision-making problems. Yet, until now, there has been no study on clustering the data represented by 
single-valued neutrosophic information. However, the existing MST clustering algorithms cannot cluster the 
single-valued neutrosophic data. Therefore, this article proposes a single-valued neutrosophic minimum 
spanning tree (SVNMST) clustering algorithm to deal with the data represented by SVNSs. To do so, the rest 
of the article is organized as follows. Section 2 introduces some basic concepts of SVNSs, the graph, and its 
MST. Section 3 defines a generalized distance measure between SVNSs. In Section 4, a single-valued neutro- 
sophic clustering algorithm is proposed based on the MST. In Section 5, two illustrative examples are given 
to demonstrate the applications and the effectiveness of the proposed approach. Conclusions and further 
research are contained in Section 6. 


2 Preliminaries 


In this section, some basic concepts of SVNSs, the graph, and its MST are introduced to be utilized in the next 
sections. 


2.1 Some Concepts of SVNSs 
Smarandache [8] introduced the concept of a neutrosophic set from philosophical point of view. 


Definition 1 [8]. Let X be a space of points (objects), with a generic element in X denoted by x. A neutrosophic 
set A in X is characterized by a truth membership function T,(x), an indeterminacy membership function 
I(x), and a falsity membership function F,(x). The functions T(x), I,(x), and F,(x) are real standard or non- 
standard subsets of }0, 1°[, that is, T,(x): X >}, 1'[, I,Q): X >)0, 1'[, and F(x): X >]-0, 1'[, with the condi- 
tion -O < sup T,(x) + sup I(x) + sup F,(x) < 3°. 

Obviously, it is difficult to apply in real scientific and engineering fields. Thus, Wang et al. [9] introduced 
the concept of an SVNS, which is an instance of a neutrosophic set. 


Definition 2 [9]. Let X be a space of points (objects) with generic elements in X denoted by x. An SVNS A in X 
is characterized by truth membership function T,(x), indeterminacy membership function I,(x), and falsity 
membership function F(x). For each point x in X, there are T(x), I,(x), F,(e[0, 1], and 0 < T,(x) + 1,0) + 
F(x) < 3. Therefore, an SVNS A can be represented by 


A={(x, T,(x), 1,0), F,00)|xe X}. 


The following expressions are defined in [9] for SVNSs A, B: 
1. AcBifand only if T,(x) < T,(), L,@) = 1,00, F,0) 2 F,(¢) for any x in X. 
2. A=BifandonlyifAcBandBcA. 


3. At={(x, F(x), 1- 1,0), T,0))lxe X}. 
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2.2 Graph and MSTs 


A graph G is a pair of sets G = (V, E), where V is the set of vertices (or nodes) and EF is the set of edges. There 
are two kinds of graphs: an undirected graph and a directed graph. Each edge in the undirected graph is an 
unordered pair {v,, vi, whereas each edge in the directed graph is an ordered pair {v, vi, where the vertices 
v, and v, are called the end points of an edge. A sequence of edges and vertices that can be traveled between 
two different vertices is called a path. Suppose five nodes are given, then a graph with five nodes and seven 
edges is shown in Figure 1. 

In Figure 1, there are different paths from the node B to the node E£, such as the path (BAE) and the other 
path (BCDE). For a path, the start node and the destination node are the same, which is called a circuit, like 
(ABCA) or (ADEA). 

A connected graph has paths between any pair of nodes. A connected acyclic graph that contains all 
nodes of G is called a spanning tree of the graph, which is any set of straight line segments connecting pairs 
of nodes such that 
1. no closed loops occur, 

2. each node is visited by at least one line, and 
3. atree is connected. 


For example, a spanning tree with integer segment lengths is shown in Figure 2, which is obtained from 
Figure 1. When the nodes B and C or D and E were connected, a closed loop would be formed and the resulting 
figure would not be a tree. Then, we define the length of a tree to be the sum of the lengths of its constituent 
edges, the length of the tree in Figure 2is 4+6+5+8 =23 units. 

An MST is the spanning tree of the minimum length that is often required. For instance, Figure 3 shows 
the MST obtained from Figure 1, and then its minimum length is 4+3+2+5=14 units. 

Various algorithms to find the MST have been proposed in [5, 6, 10]. However, there are the two most 
popular algorithms to find the MST of a graph G by operating iteratively [5, 6], which are needed later. At any 
stage, the segment belongs to one of two sets, i.e., set A containing those segments assigned to the MST and 
set B, those not assigned. 

The algorithm given by Kruskal [5] is to assign interactively to set A the shortest segment in the set B, 
which does not form a closed loop with any of the segments already in A. Initially, A is empty, and the itera- 
tion stops when A contains (n — 1) segments. 

The algorithm given by Prim [6] starts with any one of the given nodes and initially assign to A the short- 
est segment starting from this node. The procedure continues to add the shortest segment from B, which 
connects to at least one segment from A without forming a closed loop among the segments already in A. The 
iteration stops when A contains (n — 1) segments. 

Usually, clustering data sets can be represented as weighted graphs, where nodes correspond to the enti- 
ties to be clustered and edges correspond to distance measure (or called dissimilarity measure) between 


A 


Cc D 


Figure 1. Graph. 
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Cc D 


Figure 2. Spanning Tree. 


those entities. If a fuzzy relation R over V x V is defined, then the membership function wu ay v,), where (v,, 
v,)e Vx V takes various values from 0 to 1, and then such a graph is called a fuzzy graph. When R is an intui- 
tionistic fuzzy relation over V x V, then such a graph is called an intuitionistic fuzzy graph [15]. 


3 Distance Measures of SVNSs 


For two SVNSs A and B ina universe of discourse, X = {x, X,, ... ,X,}, which are denoted by A = {(x,,T,(x,), L,(), 
F ,(x,))x,e X}, and B= {(x,,T, (x), 1,0), F,(«))lx,¢ X}, where T,(x,), L(x), F,(«), T,(), 1,04), F,(«e [0, 1] for every 
x,¢X. Let us to consider the weight w, (i=1,2,...,n) ofan element x, (i=1,2,...,n), with w,20 (i=1,2,...,n) 
and yo =1. Then, we define the generalized single-valued neutrosophic-weighted distance: 


1/a 
d,(A,B) = Ae? Se Ole gp e.d come ale sea 64 Ac0r| ; (1) 


where J > 0. 
Especially if A = 1, 2, Equation (1) reduces to the single-valued neutrosophic-weighted Hamming distance 
and the single-valued neutrosophic-weighted Euclidean distance, respectively, as follows: 


d,(A,B) = Dwi) = Fla) + 104) = 4,0) +12 0g) = 2.00011 (2) 


Cc D 


Figure 3. Minimum Spanning Tree. 
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i=1 


1/2 
d,( A,B) = AED = EG ld) =a? + |B = Aco} ‘ (3) 


Then, for the distance measure we have the following proposition. 


Proposition 1. The above-defined distance d,(A, B) for A > O satisfies the following properties: 
(P1) d,(A, B) = 0; 
(P2) d,(A, B) = 0 if and only if A = B; 
(P3) d,(A, B) = d,(B, A); 
(P4) If A CBC, Cis an SVNS in X, then d,(A, C) 2 d,(A, B) and dA, C) 2 d,(B, C). 


Proof. It is easy to see that d,(A, B) satisfies the properties (P1)—(P3). Therefore, we only prove (P4). Let Ac 
BCC, then, T,(x,) < T,(x) < T(x), L,&) = I,0%) 2 I(x), and F(x) = F,(x,) = F(x) for every x,¢ X. Then, we 
obtain the following relations: 
vi A A A 
Ir,x) - TOK) s(ECy) - 1.00), [Ce - OQ] s (GQ) - RO, 


A A A 
’ ’ 


E(x) - Gp) s GQ) - £.0y) 


T(x) - 1.0) 


<|1,(%) - 1.04) 


E04) - RO) RG) - ROD JRO) - RO < |G) - £0x))- 


Hence, 


lr.) - Cx) + IO) - 00) + |F,0) - BO)) < fG) - BO) + |,0)) - Gp) + RQ) - BG), 


A 


lex) - Tx] + [Ex) - £09) + [Od - BOD) < [GD - T0,)) + (0) - Op) + RO) - RO). 


Combining the above inequalities with the above-defined distance formula (1), we can obtain that 
d(A, B) < d(A, C) and d,(B, C) < dA, C) for/ > 0. 


Thus, the property (P4) is obtained. O 
In the following, we give the definition of single-valued neutrosophic distance matrix. 


Definition 3. Let A, G = 1, 2, ..., m) be a collection of m SVNSs, then D = CA i nis Called a single-valued 
neutrosophic distance matrix, where d, = dA, A) is the distance between A, and A, and its properties are as 
follows: 

1. O< d, <1 for alli, j=1,2,...,m; 

2. d, = 0 ifand only if A, = A; 

3. d,= d, for alli, j=1, 2, ...,m. 


4 SVNMST Clustering Algorithm 


In this section, an SVNMST clustering algorithm is proposed as a generalization of an IFMST clustering algo- 
rithm [15]. 

Let X= is , aren xe} be an attribution space and the weight vector of an element x, (i=1,2,...,n) bew= 
{w,, W,, ...,W,}, with w, > 0 (i=1, 2, ...,n) and yy, = 1. Assume that A, G=1,2,...,m) is acollection of m 
SVNSs, which express m samples to be clustered. Then, there is the following form: 
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A, = {(x 0, Def, (%)e, 24) 1X}. 


Thus, we propose an SVNMST clustering algorithm, which is described by the following steps: 
Step 1. Calculate the distance of d, =d (A, A) (i, j =1, 2, ... , m) by Equation (1) to establish the single-valued 
neutrosophic distance matrix D = (d,)., i 
Step 2. Draw the single-valued neutrosophic graph G(V, E) where every edge between A, and A, is assigned 
the weight (single-valued neutrosophic distance) d,, coming from an element of the single-valued neu- 
trosophic distance matrix D = (4) n .~» Which represents the dissimilarity degree between samples A, 
and A, 
Step 3. Build the MST of the single-valued neutrosophic graph G(V, E) by Kruskal’s method [5] or Prim’s 
method [6]: 
1. Sort the edges of G in increasing order by weights. 
2. Keep a subgraph S of G, which is initially empty, and at each step, choose the edge e with the 
smallest weight to be added to the subgraph S, where the end point of e are disconnected. 
3. Repeat process (2) until the subgraph S spans all vertices. Hence, the MST of the single-valued 
neutrosophic graph G(V, E) is obtained. 


Step 4. Perform clustering by use of the SVNMST. We can get a certain number of subtrees (clustering) by 
disconnecting all the edges of the MST with weights greater than a threshold r. The clustering results 
induced by the subtrees do not depend on some particular MST [14, 15]. 


5 Illustrative Examples 


In this section, two illustrative examples are presented to demonstrate the real applications and the effective- 
ness of the proposed approach. 


Example 1. A car market is going to classify eight different cars of A, (j=1, 2, ... , 8). Every car has six evalua- 
tion factors (attributes): (1) x,: fuel consumption; (2) x,: coefficient of friction; (3) x,: price; (4) x,: comfortable 
degree; (5) x,: design; (6) x,: security coefficient. The characteristics of each car under the six attributes are 
represented by the form of SVNSs, and then the single-valued neutrosophic data are as follows: 

A,= 1h; 0.3, 0.2, 0.5>, <x,, 0.6, 0.3, 0.1>, <X,, 0.4, 0.3, 0.3>, <Xx,, 0.8, 0.1, 0.1>, <x,, 0.1, 0.3, 0.6>, <x,, 0.5, 
0.2, 0.4>}; 

A={<x, 0.6, 0.3, 0.3>, <x,, 0.5, 0.4, 0.2>, <X, 0.6, 0.2, 0.1>, <X,, 0.7, 0.2, 0.1>, <x,, 0.3, 0.1, 0.6>, <x,, 0.4, 
0.3, 0.3>}; 

A,={<x,, 0.4, 0.2, 0.4>, <x,, 0.8, 0.2, 0.1>, <x, 0.5, 0.3, 0.1>, <x,, 0.6, 0.1, 0.2>, <x,, 0.4, 0.1, 0.5>, <x, 0.3, 
0.2, 0.2>}; 

A ={<x,, 0.2, 0.4, 0.4>, <x,, 0.4, 0.5, 0.1>, <x,, 0.9, 0.2, 0.0>, <x,, 0.8, 0.2, 0.1>, <x,, 0.2, 0.3, 0.5>, <x,, 0.7, 
0.3, 0.1>}; 

A={<x,, 0.2, 0.3, 0.3>, <x, 0.3, 0.2, 0.6>, <x,, 0.5, 0.1, 0.4>, <x,, 0.7, 0.1, 0.1>, <x,, 0.4, 0.2, 0.4>, <x,, 0.3, 
0.2, 0.6>}; 

A={<x, 0.3, 0.2, 0.4>, <x,, 0.2, 0.1, 0.7>, <X,, 0.4, 0.2, 0.5>, <X,, 0.8, 0.0, 0.1>, <X,, 0.4, 0.3, 0.5>, <X,, 0.2, 
0.1, 0.7>}; 

A={<x, 0.4, 0.4, 0.3>, <x,, 0.5, 0.3, 0.1>, <X3, 0.6, 0.1, 0.2>, <X,, 0.2, 0.3, 0.7>, <X,, 0.3, 0.1, 0.5>, <X,, 0.7, 
0.2, 0.1>}; 

A,={<x, 0.4, 0.1, 0.2>, <x,, 0.6, 0.1, 0.1>, <X3 0.8, 0.2, 0.1>, <X,) 0.7, 0.2, 0.1>, <x,, 0.1, 0.1, 0.8>, <x, 0.2, 
0.1, 0.8>}. 
If the weight vector of the attribute x, (i=1, 2, ... , 6) is w = (0.16, 0.12, 0.25, 0.2, 0.15, 0.12)", then we utilize the 
SVNMST clustering algorithm to group the eight different cars of A, G=1,2,..., 8). 
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Step 1. Calculate the distance d, = dA, A) by Equation (1) (take A = 2). Then we can establish the single- 
valued neutrosophic distance matrix D = (d.) as follows: 


i mxm 
[0.0000 0.2737 0.2551 0.3583 0.4022 0.4559 0.5052 0.4100 | 
0.2737 0.0000 0.2269 0.2605 0.4064 0.5334 0.4271 0.4117 
0.2551 0.2269 0.0000 0.3311 0.4577 0.5527 0.3983 0.4450 
0.3583 0.2605 0.3311 0.0000 0.5442 0.6720 0.4559 0.5794 
0.4022 0.4064 0.4577 0.5442 0.0000 0.1987 0.5957 0.4811 | 
0.4559 0.5334 0.5527 0.6720 0.1987 0.0000 0.7155 0.5248 
0.5052 0.4271 0.3983 0.4559 0.5957 0.7155 0.0000 0.6370 
0.4100 0.4117 0.4450 0.5794 0.4811 0.5248 0.6370 0.0000 


Step 2. Draw the single-valued neutrosophic graph G(V, E) where every edge between A, and A, (i, j=1, 2, ..., 8) 
is assigned the weight (single-valued neutrosophic distance) d,, coming from an element of the single- 
valued neutrosophic distance matrix D = (i) n .m Which represents the dissimilarity degree between 
the samples A, and A.. Then, the single-valued neutrosophic graph G(V, E) is shown in Figure 4. 

Step 3. Establish the MST of the single-valued neutrosophic graph G(V, E) by Kruskal’s method [5] or Prim’s 
method [6]: 

1. Sort the edges of Gin increasing order by weights: 
a. < a < d,, s as, s dy < d,, S di s ds s d,, <d,, s a, < d,. s d,, < dy < d= d,, < d,, < a, s d,, < a. s 
d,, < d,, < ag < dis < d., < d.. < a), = d., 

2. Keep an empty subgraph S of G and add the edge e with the smallest weight to S, where the end 
points of e are disconnected; thus, we choose the edge Cx between A, and A,. 

3. Repeat process (2) until the subgraph S spans eight nodes. Thus, the MST of the single-valued 
neutrosophic graph G(V, E) is obtained, as shown in Figure 5. 


Step 4. Select a threshold r and disconnect all the edges of the MST with weights greater than r to obtain a 
certain number of subtrees (clusters), as listed in Table 1. 


To compare the SVNMST clustering algorithm with the intuitionistic fuzzy MST clustering algorithm and the 
fuzzy MST clustering algorithm, we introduce the following example discussed in [14, 15] for comparative 
convenience. 


Example 2. To complete an operational mission, the six sets of operational plans are made (adapted from 
[14, 15]). To group these operational plans with respect to their comprehensive function, a military committee 


A; A, 


As As 


Figure 4. Single-Valued Neutrosophic Graph G(V, £) with the Eight Nodes. 
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Figure 5. The MST of the Single-Valued Neutrosophic Graph G(V, E£) with the Eight Nodes. 


has been set up to provide assessment information on them. The attributes that are considered here in assess- 
ment of the six operational plans A, (j= 1,2, ... , 6) are (1) x, is the effectiveness of operational organization 
and (2) x, is the effectiveness of operational command. The weight vector of the attributes x, (i = 1, 2) is w = 
(0.45, 0.55)". The military committee evaluates the performance of the six operational plans A, (Gj =1, 2, ... , 6) 
with respect to the attributes x, (i= 1, 2) and gives the SVNSs as follows: 


A, ={<x, 0.7, 0.2, 0.15>, <x,, 0.6, 0.3, 0.2>}; 
A, = {<x, 0.4, 0.3, 0.35>, <x,, 0.8, 0.1, 0.1>}; 
A, = {<x,, 0.55, 0.2, 0.25>, <x,, 0.7, 0.1, 0.15>}; 
A,={<x, 0.44, 0.2, 0.35>, <x,, 0.6, 0.2, 0.2>}; 
A, = {<x,, 0.5, 0.15, 0.35>, <Xx,, 0.75, 0.1, 0.2>}; 


A, =1<X, 0.55, 0.2, 0.25>, <x,, 0.57, 0.2, 0.15>}. 


Then we employ SVNMST clustering algorithm to group these operational plans A, (Gj=1,2,..., 6). 
Step 1. Calculate the distance d, = dA, A) by Equation (1) (take 2 = 2). Then we can establish the single- 
valued neutrosophic distance matrix D = (d;) as follows: 


mxm 


[0.0000 0.2327 0.1507 0.1637 0.1940 0.1051 | 
0.2327 0.0000 0.1127 0.1326 0.1131 0.1546 
0.1507 0.1127 0.0000 0.1056 0.0764 0.0802 
0.1637 0.1326 0.1056 0.0000 0.0940 0.0784 |’ 
0.1940 0.1131 0.0764 0.0940 0.0000 0.1210 
0.1051 0.1546 0.0802 0.0784 0.1210 0.0000 


Table 1. SVNMST Clustering Results of the Eight Different Cars. 


r Corresponding to clustering result 
r=d,,=0.41 {Ay Ay Ags Ags Ags Ags Ays Ag} 

r=d,,= 0.4022 {Ay Ay Ags Ags Ags Ag Ards TAs} 

r=d,, = 0.3983 {A,, Ay Ay Ay A,}s (Ag Ags {Ag} 
r=d,,= 0.2605 {A,, A, Ay A,}s Ags Ads (A,} 1A} 
r=d,,=0.2551 {A,, A,, A} fA,}, {A,, A,} {A,}, {A,} 
r=d,,= 0.2269 {A,}, {A,, Aj} fA,}, {A,, A} {A,}, fA,} 
r=d,,= 0.1987 {A,}, {A,}, {Aj} {A,}, fA,, A,} {A} {As} 
r=0 {A,}, {A,}, {A} {A,}, {A}, {A} {A,}, {A} 
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Step 2. Draw the single-valued neutrosophic graph G(V, E) where every edge between A, and A (i,j =1, 2, ... , 6) 
is assigned the weight (single-valued neutrosophic distance) d, coming from an element of the single- 
valued neutrosophic distance matrix D = (i) nx Which represents the dissimilarity degree between 
the samples A, and A.. The single-valued neutrosophic graph G(V, E) is shown in Figure 6. 

Step 3. Build the MST of the single-valued neutrosophic graph G(V, E) by Kruskal’s method [5] or Prim’s 
method [6]. 

1. Sort the edges of Gin increasing order by weights: 


d,. < a), < d,, < d,, < d,, < a,, < d,, < d,, < dg < dy, < di, < d,, s d,, > ai < d,,. 


2. Keep an empty subgraph S of G and add the edge e with the smallest weight to S, where the end 
points of e are disconnected; thus, we choose the edge C5, between A, and A.. 

3. Repeat process (2) until the subgraph S spans six nodes. Thus, the MST of the single-valued neu- 
trosophic graph G(V, E) is obtained, as shown in Figure 7. 


Step 4. Select a threshold r and disconnect all the edges of the MST with weights greater than r to obtain a 
certain number of subtrees (clusters), as listed in Table 2. 
To compare the SVNMST clustering algorithm with the IFMST clustering algorithm in [15], assume that the 
indeterminacy membership function I,(x,) is not considered independently in an SVNS A, then the samples’ 
information given by the military committee will be the intuitionistic fuzzy data (adopted in [15]): 


A, = {<x,, 0.7, 0.15>, <x,, 0.6, 0.2>}, A, = {<x,, 0.4, 0.35>, <x, 0.8, 0.1>}; 


As 


Figure 6. Single-Valued Neutrosophic Graph G(V, £) with the Six Nodes. 


A; 


b 
nS 


Ay As 


Figure 7. The MST of the Single-Valued Neutrosophic Graph G(V, £) with the Six Nodes. 
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Table 2. SVNMST Clustering Results of the Six Operational Plans. 


r Corresponding to clustering result 
r=d,,=0.1127 {A,,A,,A,, A,» Ags Ag} 
r=d,,=0.1051 {A}, (A, 4,4, A, At 

r=d,,= 0.0802 {A,}, {A,}, {A,, A, A Ad 

r=d,,= 0.0784 {A,}, {A,}, {A,, A.}, {A,, A,} 

r=d,,= 0.0764 {A,}, {Aj} {A,, Ag} {A,} {A} 

r=0 {A,}, {A,}, {Aj} {A,}, fA,}, {A} 


A, = {<x,, 0.55, 0.25>, <x, 0.7, 0.15>}, A, = {<x,, 0.44, 0.35>, <x, 0.6, 0.2>}; 


A, = {<x,, 0.5, 0.35>, <x,, 0.75, 0.2>}, A, = {<x,, 0.55, 0.25>, <x,, 0.57, 0.15>}. 


Then, the operational plans A, (j =1, 2, ... , 6) can be clustered by the following IFMST clustering algorithm 
[15]. 
Step 1. Calculate d, =d(A, A) by the intuitionistic fuzzy distance measure: 


1/2 
a8.0)=| 2S win) —AyC + ly, =1,0Q))? + In(4) 25007 ; (4) 


Thus, we get the intuitionistic fuzzy distance matrix: 


[0.0000 0.2450 0.1225 0.1170 0.1725 0.1115 | 
0.2450 0.0000 0.1225 0.1280 0.1000 0.1940 
0.1225 0.1225 0.0000 0.1045 0.1000 0.0715 
0.1170 0.1280 0.1045 0.0000 0.1095 0.0935 |" 
0.1725 0.1000 0.1000 0.1095 0.0000 0.1715 
0.1115 0.1940 0.0715 0.0935 0.1715 0.0000 


Step 2. Structuring the intuitionistic fuzzy graph G = (V, E), see Step 2 in the SVNMST clustering algorithm 
and Figure 6. 
Step 3. Get the MST of the intuitionistic fuzzy graph G = (V, E) by Kruskal’s method [5] or Prim’s method [6]: 
1. Sort the edges of G in increasing order by weights: 
d,, < thie < das = as s ay s d,, < d,, < d,, s d,, = d,, < d,, < a... s di, s d,, < d,,. 
2. Keep an empty subgraph S of G and add the edge e with the smallest weight to S, where the end 
points of e are disconnected; thus, we choose the edge e,, between A, and A,. 
3. Repeat process (2) until the subgraph S spans six nodes. Thus, the MST of the intuitionistic fuzzy 
graph G(V, E) is obtained, as shown in Figure 8. 


Step 4. Select a threshold r and disconnect all the edges of the MST with weights greater than r to obtain a 
certain number of subtrees (clusters), as listed in Table 3. 


It is well known that the fuzzy set is only composed of the membership degree. Therefore, we only consider 
the membership degrees of intuitionistic fuzzy data, and then the operational plan information given by the 
following fuzzy data [15]: 


A, ={<x,, 0.7>, <x, 0.6>}, A, = {<x,, 04>, <x, 0.8>}, A, = {<x,, 0.55>, <x,, 0.7>}; 


A,={<x,, 0.44>, <x, 0.6>}, A, = {<x,, 0.5>, <x,, 0.75>}, A, = {<x,, 0.55>, <Xx,, 0.57>}. 
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e 
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Figure 8. The MST of the Intuitionistic Fuzzy Graph G = (V, £). 


Then, the operational plans A, (j= 1, 2, ... , 6) can be clustered by the fuzzy MST clustering algorithm: 
Step 1. Calculate d, = d(A,, A,) by the fuzzy distance measure: 


1/2 
a 8,0)-] Sas) 14(49F] (5) 


i=1 


Thus, we obtain the fuzzy distance matrix: 


[0.0000 0.2500 0.1250 0.1744 0.1743 0.1031 
0.2500 0.0000 0.1250 0.1507 0.0766 0.1980 
0.1250 0.1250 0.0000 0.1046 0.0500 0.0964 
0.1744 0.1507 0.1046 0.0000 0.1183 0.0771 | 
0.1743 0.0766 0.0500 0.1183 0.0000 0.1376 
0.1031 0.1980 0.0964 0.0771 0.1376 0.0000 


Step 2. Draw the fuzzy graph G = (V, E). See also Step 2 in the SVNMST clustering algorithm and Figure 6. 
Step 3. Build the MST of the fuzzy graph G = (V, E) by Kruskal’s method [5] or Prim’s method [6]: 
1. Sort the edges of Gin increasing order by weights: 
a, < d,, < d,, < d,, < die < d,, < d,, < d,, oa aq < a, < d,, s d,, < d,, < d,, < d,,. 
2. Keep an empty subgraph S of G and add the edge e with the smallest weight to S, where the end 
points of e are disconnected; thus, we choose the edge e, between A, and A,. 
3. Repeat process (2) until the subgraph S spans six nodes. Thus, the MST of the fuzzy graph G(V, E) 
is obtained, as shown in Figure 9. 


Table 3. IFMST Clustering Results of the Six Operational Plans. 


r Corresponding to clustering result 
r=d,,=0.1115 {A Ay Ags Ags Acs Ac} 
r=d,,=d,,=0.1  {A},{A,A,A,A,A} 

r=d,,= 0.088 {A,}, {A,}, {A}, {A,, A, Act 
r=d,,=0.0715 {A,}, {A,}, {A}, {A}, {A,, A 

r=0 {A,}, {Aj}, {Aj} {A,}, {A}, fA} 
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A; 


Figure 9. The MST of the Fuzzy Graph G = (V, &). 


Step 4. Select a threshold r and disconnect all the edges of the MST with weights greater than r to obtain a 
certain number of subtrees (clusters), as listed in Table 4. 


For convenient comparisons, we put the clustering results of three kinds of clustering algorithms into Table 5. 

From Table 5, we can see that the clustering results of the three clustering algorithms are quite different. 
The main reason can be given by the following comparative analysis. 

As is known to all, the single-valued neutrosophic information is a generalization of intuitionistic fuzzy 
information, and intuitionistic fuzzy information is a further generalization of fuzzy information. On the one 
hand, an SVNS is an instance of a neutrosophic set, which gives us an additional possibility to represent 
uncertainty, imprecise, incomplete, and inconsistent information, which exist in real world. It can describe 
and handle indeterminate information and inconsistent information. However, the connector in the fuzzy set 
is defined with respect to T, i.e., membership only, hence the information of indeterminacy and nonmember- 
ship is lost. The connectors in the intuitionistic fuzzy set are defined with respect to T and F, i.e., membership 
and nonmembership only; hence, the indeterminacy is what is left from 1 and then the intuitionistic fuzzy 
set can only handle incomplete information but not the indeterminate information and inconsistent informa- 
tion, whereas in the SVNS, its truth membership, indeterminacy membership, and falsity membership are 
represented independently, and then they can be defined with respect to any of them (no restriction). There- 
fore, the notion of SVNSs is more general. On the other hand, the clustering analysis under single-valued 
neutrosophic environment is suitable for capturing imprecise, uncertain, and inconsistent information in 
clustering the data. Thus, the SVNMST clustering algorithm clusters the single-valued neutrosophic infor- 
mation, whereas the IFMST clustering algorithm clusters the intuitionistic fuzzy information and the fuzzy 
MST clustering algorithm clusters the fuzzy information. Obviously, the SVNMST clustering algorithm is the 
extension of both the IFMST clustering algorithm and the fuzzy MST clustering algorithm. Therefore, com- 
pared with the IFMST clustering algorithm and the fuzzy MST clustering algorithm, the SVNMST clustering 


Table 4. Fuzzy MST Clustering Results. 


r Corresponding to clustering result 
r=d,,= 0.1031 {As Ay Ags Ay Ags Ad 

r=d,,= 0.0964 {A} Ay Ay Ay Ags Ad 

r=d,,= 0.0774 {A}, (A, A, Ad fA, Ad 

r=d,, = 0.0766 {A,}, {A,, A, A,}, {A,}, {A} 

r= a. = 0.0500 {A}, {A,}.{A,, Aj}, {A,}, {A,} 

r=0 {A,}, {A,}, {A} {A,} {A,}, {At 
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Table5. Clustering Results of Three Kinds of Clustering Algorithms. 


Class SVNMST clustering algorithm IFMST clustering algorithm Fuzzy MST clustering algorithm 


1 fA, As Ag Aus Ags Ad (A, Ay Ay Ay Ags Ad {A,, Ay Ay Ay Ags Ad 

2 {A}, ys Ay Ay Ags Ad {A}, Ay Ay Ay Ags Ad {A}, Ay Ay Ay Ags Ad 

3 {A}, 1A}, Ay Ay Ags Ad {A}, 1A, A, A} 1A, Ad 

4 {A,}, {A,}, {A,, A}, (A, Ag} {A,}, {A,}, {A} {A,, A, Ag} {A,}, {A,, A; A,}, {A} {A} 

5 {A,}, {A}, {A,, A,}, {A,}, {A} {A,}, {A,}, {A,} {A}, fA, Ao {A,}, {A,}, {A,, A,}, {A,} {A} 
6 {A,}, {A,}, {A} {A,}, fA}, {A {A,}, {A,}, {A} {A,} fA}, {A {A,}, {A,}, {A} fA,}, fA}, {A 


algorithm is more general. Furthermore, when we encounter some situations, which are represented by inde- 
terminate information and inconsistent information, the SVNMST clustering algorithm demonstrates its great 
superiority in clustering those single-valued neutrosophic data. 


6 Conclusion 


This article defined a generalized single-valued neutrosophic weighted distance and proposed the SVNMST 
clustering algorithm as a generalization of the IFMST clustering algorithm. Through the computational tests 
on the SVNMST clustering algorithm, the IFMST clustering algorithm, and the fuzzy MST clustering algorithm, 
the clustering results have shown that the SVNMST clustering algorithm is more general and more reason- 
able than the IFMST clustering algorithm and the fuzzy MST clustering algorithm. Furthermore, in situations 
that are represented by indeterminate information and inconsistent information, the SVNMST clustering 
algorithm demonstrates its great superiority in clustering those single-valued neutrosophic data because the 
SVNSs are a powerful tool to deal with uncertainty, imprecise, incomplete, and inconsistent information. In 
the future, the developed algorithm can be applied to many areas such as information retrieval, investment 
decision making, and data mining. 


Received September 25, 2013; previously published online January 8, 2014. 
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